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Abstract: The identification of hadronically decaying heavy states, such as vector 

bosons, the Higgs, or the top quark, produced with large transverse boosts has been and 
will continue to be a central focus of the jet physics program at the Large Hadron Collider 
(LHC). At a future hadron collider working at an order-of-magnitude larger energy than the 
LHC, these heavy states would be easily produced with transverse boosts of several TeV. 
At these energies, their decay products will be separated by angular scales comparable to 
individual calorimeter cells, making the current jet substructure identification techniques 
for hadronic decay modes not directly employable. In addition, at the high energy and 
luminosity projected at a future hadron collider, there will be numerous sources for con¬ 
tamination including initial- and final-state radiation, underlying event, or pile-up which 
must be mitigated. We propose a simple strategy to tag such “hyper-boosted” objects 
that dehnes jets with radii that scale inversely proportional to their transverse boost and 
combines the standard calorimetric information with charged track-based observables. By 
means of a fast detector simulation, we apply it to top quark identification and demonstrate 
that our method efficiently discriminates hadronically decaying top quarks from light QCD 
jets up to transverse boosts of 20 TeV. Our results open the way to tagging heavy objects 
with energies in the multi-TeV range at present and future hadron colliders. 
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1 Introduction 

The discovery of the Higgs boson [1, 2] represents a landmark in the exploration of the 
electroweak scale started with Run I of the LHC. The forthcoming Run H will extend the 
energy frontier for the direct searches further into the multi-TeV range, providing the hrst 
data that will test the up-to-now complete Standard Model (SM) to higher energies and to 
higher degree of precision. Among the best probes available to study the electroweak scale 
and the underlying mechanism responsible for its breaking are the vector bosons, the Higgs, 
and the top quark. Because of their large masses with respect to the electroweak scale, 
this implies a “strong” interaction with the Higgs held and therefore possibly enhanced 
couplings to new resonances and a higher sensitivity to deviations from SM predictions. In 
particular, the heavy states in the SM are responsible for large radiative corrections to the 
Higgs mass leading to the hierarchy problem. New physics models that solve the hierar¬ 
chy problem and feature states at multi-TeV scales therefore will presumably signihcantly 
couple to heavy bosons and the top quark. 

This sensitivity to new physics makes the efficient identihcation of vector bosons, the 
Higgs, and the top quark an important goal for the LHC and future colliders. Two simple 
factors need to be taken into account in the quest for the optimal tagging strategies. The 
first is that the largest decay rates of all the electroweak heavy states of the SM are into 
hadronic final states, producing multiple jets. For instance, the top quark features a two- 
body (semi-weak) decay into a b quark and a W boson, the latter decaying to a jet pair 
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around 67% of the time. For the Higgs boson the branching ratio into jets approaches 
80%. The inability to select and identify such objects when decaying hadronically would 
therefore severely limit the final statistics available for physics studies. The second factor 
is that all methods for identification and the relevant backgrounds are highly sensitive to 
the pt of the heavy state. For example, top quarks produced near threshold decay to 
widely-separated and relatively low px jets, with dominant backgrounds from multi-jet 
production in QCD. This low-pT regime presents significant challenges, both because QCD 
backgrounds can be enormous and because the combinatorics of determining the set of jets 
which came from a single top quark decay can be very inefficient. The same argument 
applies to the heavy bosons of the SM. 

At moderate boosts, with pT up to a few times the heavy state mass, the decay products 
begin to merge, and become collimated in the detector. In this regime, combinatoric 
ambiguities and QCD backgrounds are greatly reduced. All jets produced from the decay of 
a heavy state can be clustered into the same jet, with a large jet radius, and so backgrounds 
are limited to high-py jets in QCD which have masses around that of the heavy state. 
Further, one can study the substructure of the fat jets to identify prongy structure that 
would be a signature of a decay, and be highly unlikely in a QCD jet. With this motivation, 
the past several years have seen substantial development and implementation of observables 
for identifying boosted hadronically decaying heavy objects produced at the LHC [3-5]. 
Several of the most powerful techniques have been validated on data [6-28] and are now 
standard tools for jet analysis at ATLAS and CMS. 

Both because of the relatively low boosts and the fine granularity of the detectors, 
techniques for identifying heavy states at the LHC have impressive performance by ex¬ 
ploiting all of the radiation from an event. However, looking forward to Run H of the LHC 
and beyond, many of the techniques used thus far will be significantly limited for several 
reasons. First, at higher luminosities there are more secondary proton collisions (pile-up) 
that contaminate the primary hard collision event. Tracking can be used to identify the 
charged particles that originated from the hard collision, but observables that depend on 
calorimetry will be significantly degraded by pile-up contamination. Grooming the jet [29- 
39] to remove that radiation that most likely came from pile-up will be required. Also, 
as the centre-of-mass energy of the LHC increases, more heavy bosons and top quarks at 
higher transverse momenta will be created. With a transverse momentum of a few TeV, 
the decay products begin to merge into a single calorimeter cell with the current resolu¬ 
tion of ATLAS and CMS. At higher transverse momenta, unless the angular granularity 
increases significantly, using the calorimeter for identification of weak-scale particles will be 
essentially impossible. In this “hyper-boosted” regime, where pts approach and exceed 10 
TeV, it may be possible to access multi-boson/multi-top signatures such as VVV, tiH/V, 
four top quarks, or even more exotic final states. Such final states produce huge numbers 
of low Pt jets and it would to be extremely difficult to reconstruct the individual heavy 
states, separate them from background, or resolve combinatorics. 

With the recent excitement for a long-term goal of a 100 TeV future circular collider 
(FCC) [40-42], both of these issues become more acute. For example, at such high collision 
energies, weak bosons and the top quark could be produced with transverse momenta of 
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Cross section at pp, ^/s = 100 TeV 


Process 

Px > 1 TeV 

Px > 5 TeV 

Px > 10 TeV 

(Pb) 

(fb) 

(ab) 


pp —7- tt 

12 

2.8 

24 


PP tij 

52 

14 

94 


pp tj 

0.67 

0.46 

0.76 

^ zn 

pp —)• tiV 

0.40 

0.30 

3.7 

T) 

pp —)• ttH 

0.19 

7.4e-02 

0.65 

cC 

cd "d 

bJO 

pp —)• ttti 

0.17 

8.5e-02 

0.51 

pp jj 

3500 

1000 

11000 

CD ^ 

pp jjV 

110 

130 

2200 


pp ^ Z' ^ ti {mz' = 3 TeV) 

4.6 

- 

- 

pp ^ Z' ^ ti {mz' = 15 TeV) 

7.1e-03 

4.7 

- 

^ pp ^ Z' ^ ti {mz' = 30 TeV) 

7.1 e-05 

6.5e-02 

48 

^ pp ^ it ^ ti + $x ~ 1 TeV) 

0.49 

7.8e-03 

- 

pp ^ it ^ ti + $x = b TeV) 

7.5e-04 

0.063 

- 

pp ^ ii ^ ti + $x ("if = 10 TeV) 

4.4e-06 

0.27e-03 

0.024 

PP ^ 99 ^ titi + $T = 2 TeV) 

2.5 

0.94 

- 

PP ^ 99 ^ iiii + = 5 TeV) 

2.7e-02 

1.5 

11 

PP ^ 99 ^ titi + (i^g = 10 TeV) 

1.9e-04 

0.12 

4.5 


Table 1: Inclusive leading-order cross sections for Standard Model and beyond-Standard 
Model (BR = 1) processes with at least one top quark with px > 1, 5, 10 TeV at a 100 
TeV future collider. For the Standard Model backgrounds, the momentum requirement is 
imposed on one of the jets. Omitted entries have cross sections which are too small to be 
relevant. 


order 10 TeV, with decay products separated by angles much smaller than the resolution 
of any foreseeable (sufficiently compact) calorimeter. To be more concrete, in Table 1, 
we collect representative inclusive cross sections for several SM and beyond-SM (BSM) 
processes at a 100 TeV collider calculated to leading order. With very strong px cuts on 
jets in the final state, backgrounds of jets from QCD can be dramatically reduced, while 
probing heavy mass resonances. For example, the decay of a SM-like Z' of 15 TeV to 
top quarks (assuming a branching fraction equal to 1) would produce about 5 events per 
fb“^ at 100 TeV with px > 5 TeV, while the dominant background, from pp — )• jj events, 
produces about 1000 events per fb“^. Further predictions at next-to-leading order in QCD 
for cross sections with final state W, Z, H bosons and top quarks can found in Ref. [43] . 

The 100 TeV collider is projected to run at luminosities that are orders of magnitude 
greater than the LHC and so pileup will be a huge issue and grooming techniques will 
need to be understood at transverse momenta well beyond their current range of validity. 
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In addition, even resolving these issues, for a QCD jet with large transverse momentum, 
a perturbative emission can affect the jet mass by hundreds of GeV, comparable to the 
electroweak scale. The mass of a jet of radius i? ~ 1 and transverse momentum pT is 
approximately given by 

m'^, ( 1 . 1 ) 

where is the transverse momentum of a soft emission clustered in the jet. For example, 
at transverse momenta of pr ^ 6 TeV, an emission of only = 5 GeV would contribute 
to the jet mass an amount greater than the mass of the top quark. 

Therefore, to push identification of electroweak-scale states to higher and higher en¬ 
ergies requires controlling all of these effects. In this paper, we propose a new method 
as a robust and relatively pr-independent procedure for identifying heavy SM states at 
very high boosts produced at a future collider and apply it to the case of the top quark. 
Our approach to boosted object discrimination at very high pT consists of combining three 
elements: 

I. Global-jet calorimetric information 

We are interested in the extreme limit where the energy deposit of a jet is confined 
to a single, or only a few, calorimetric cells. We assume that the information on the 
total energy of the jet (possibly determined also using the track information as in a 
particle-flow algorithm [44]) is available, but detailed information about the energy 
distribution inside the jet is not. 

II. Inner-jet charged track information 

We exploit the high angular resolution of the tracking system to define observables 
sensitive to the internal structure of the jet only using charged particle tracks. 

III. Dynamic contamination removal 

Contamination from initial state radiation (ISR), underlying event (UE) and pile-up 
is proportional to area of the jet. In addition for a coloured particle such as the top 
quark, for a fixed jet radius more perturbative QCD radiation is emitted as the pT 
increases and is collected by the jet algorithm. Such an effect degrades the accuracy 
of the jet mass reconstruction and therefore the efficiency of tagging the boosted 
object. On the other hand, the typical angular size of a jet generated by a particle 
of mass m scales as m/pT, i.e., inversely proportional to the transverse momentum. 
To mitigate all these effects we propose use of a jet radius that scales inversely with 
the jet pt- This is similar to the variable R jet algorithm [45], with the important 
difference that in our method the clustering metric is not modified. 

To develop and test our identification strategy, we employ Delphes [46, 47], a modular 
and fast detector simulation framework. With Delphes, we are able to make qualitative, 
and (thanks to especially designed modifications/improvements) to some extent also quan¬ 
titative statements about finite resolution effects on the final discrimination power. 

This paper is organised as follows. In Sec. 2, we introduce the methods and observables 
used for the identification of heavy state jets at very high px explicitly applied to the case 
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of a top quark. In addition to identifying top quarks through the jet invariant mass 
reconstruction, we measure A^-subjettiness and energy correlation function observables to 
identify the substructure characteristic of a top quark decay. In Sec. 3, we apply our 
top quark finding procedure to a single jet pT range, presenting a detailed analysis and 
comparison between LHC detectors and projected performance of detectors in a future 
collider. We also show that, even in the hyper-boosted regime, top quark identification 
efficiencies can be 40%, while rejecting over 90% of the dominant background. We draw our 
conclusions and discuss the outlook of tagging hyper-boosted objects in Sec. 4. Appendices 
provide details of the DELPHESfast detector simulation used throughout this paper and 
plots of top tagging results for a wide range of jet pT- 

2 Observables and methodology 

In this section, we introduce the methods and observables that we will use throughout this 
paper to identify boosted hadronically decaying states. While the approach is quite general 
and, as it will be clear in the following, could be applied to tag other heavy states such as 
colour-singlet bosons, in this work we will focus on top quark identification. 

2.1 Jet finding and definition 

For a given jet radius R, the angular size of the top quark decay products scales as mtop/pT 
and therefore significantly shrinks as the pT increases. On the other hand even in the 
absence of pile-up, the amount of radiation in the event, both from the initial and final 
state, generically increases with the jet pT- Our aim will be to show that mitigating both 
initial- and final-state radiation effects can be efficiently achieved by dynamically scaling 
the jet radius by the jet pT- 

Because the top quark is unstable, final-state radiation (FSR) arises from two possible 
sources: either before it decays, or from the the daughter bottom quark after the decay. ^ 
Ideally, for an optimal invariant mass reconstruction, the former type of radiation should 
not be clustered into the top quark jet while the latter should. At low pt^, radiation from 
top quark is suppressed, as it must vanish when the top is at rest. This is a general property 
of QCD (and QED) radiation from any massive particle that often phrased in terms of a 
dead cone: radiation about the direction of motion of the top quark is suppressed in a 
cone of angular size mtop/pT', see Fig. 1. As the momentum of the top quark increases, the 
dead-cone shrinks and more phase space opens up for radiation from the top quark. 

The contribution from ISR (or UE) to the pT of the jet scales like R? (proportional 
to the area of the jet) while the effect on the squared mass scales like R^ because ISR is 
approximately uncorrelated with final state jets. At very high transverse momenta, ISR will 

^We remind the reader that the notion of initial- and hnal-state radiation and that of associating emis¬ 
sions to individual coloured particles is an approximation valid only in the collinear limit in the narrow- 
width approximation. For the top quark, the notion of radiation before or after its decay is meaningful 
when l/i?rad — Trad < Uop = l/Ftop, where Frad is the characteristic energy scale of the radiation. By 
contrast, for a colour-singlet like the W boson, radiation from its decay products are localised about the 
direction of the W, in a region scaling like the characteristic angular size of the decay, mw Ipt- 
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Figure 1: Illustration of the dead-cone effect. Final-state radiation from a top quark is 
suppressed within an angular region scaling like i?d.c. ~ w-top/PT; called the dead-cone 
region. As the pT of the top quark increases, the amount of radiation emitted from the 
top quark in a fixed angular region increases both due to the increase in energy and the 
decrease in size of the dead-cone region. 


only distort the pT spectrum of the jets slightly, but can affect the jet mass substantially. 
To see this behavior, consider a top quark jet of radius R with a single emission near the 
boundary of the jet. As long as the energy of this emission is small with respect to the 
energy of the top, its effect on the jet mass m is approximately given by 

- w-top + PT pf^R^ . (2.1) 


Therefore, an ISR emission in the jet can contribute a mass comparable to the mass of the 
top quark when 


plSR^ 


m. 


top 


PtR"^ 


( 2 . 2 ) 


At moderate boosts, typical of the LHC, where pr ~ few x mtop, ISR of 50 GeV or so can 
affect the jet mass by an 0(1) amount (with R ~ 1). While this is quite hard radiation at 
the LHC, its effect must be mitigated in top quark mass measurements; numerous methods 
have been introduced to groom jets so as to remove contaminating radiation in the jet [29- 
39]. However, at the ranges of pts accessible by a future collider, the jet px can be several 
to tens of TeV. In this regime, even emissions of a few GeV can change the jet mass by an 
0(1) amount. Thus, it is absolutely necessary to consider these effects at a future collider. 

One is therefore lead to consider whether grooming methods that have been successfully 
applied at the LHC to mitigate contamination from both FSR and ISR could be applied at 
higher energies and luminosities. So far the standard grooming techniques have not been 
studied in detail in such an extreme environment^ and so we will consider an alternative 
(simpler and possibly more robust) approach. 

To this aim, we note that the ISR effects scale like the jet radius to a positive power 
and the dead-cone effect suppresses FSR in a region of angular size rritopfpT- We can 
therefore reduce the contamination from radiation by appropriately scaling the jet radius 
by (the inverse of) its px- The specific procedure that we employ for boosted top quarks 


2 


Some studies of extreme pile-up or jet grooming at energies and luminosities beyond current LHC 


applications are [27, 28, 48-50]. 
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is the following. We first find jets with the anti-fc^ algorithm [51] and the Winner-Take- 
All (WTA) recombination scheme [50, 52-54] with a fixed jet radius, which we take to be 
R = 1.0. We then recluster the jet with the anti-fc^ algorithm where we set the subjet 
radius to: 

= (2.3) 

PT 

where (7 is a constant and px is defined by the original, fixed-radius, jet. We then keep only 
the hardest subjet found with this procedure and promote it to be the boosted top quark 
jet. With this prescription, the resulting jet is parametrically affected by contamination 
from ISR as 


-'mtop+P tPt 


ISR fCmtop\ 


PT 


) 




(2.4) 


whose effects are suppressed by the (small) ratio p^^/px- Similarly, because we choose the 
jet radius to scale with jet px, the dead-cone effect suppresses contamination from FSR 
over a fraction of the area of the jet that is independent of jet px- To ensure that the 
scaled radius jet captures the decay products of the top quark, in the remainder of the 
paper, we will adopt (7 = 4, corresponding to including above 95% of the energy fraction 
of the top quark decay products.^ We note that this coefficient should be optimised in 
an actual analysis. However, as our aim is to illustrate the principle, we prefer to choose 
a conservative value so to be sure to cluster as much radiation from the top quark decay 
products as possible. 

This approach is similar to variable R jets [45] in which the clustering metric for the 
kx class of algorithms [51, 56-60] is modified to be 


dij = min[pflJ,p^O]R|, 


diB — P^i j 

PTi 


(2.5) 


where /? is a dimensionful constant. The beam distance diB sets the effective jet radius to 
scale inversely with the px of the jet. In contrast to variable R jets, our procedure does 
not modify the jet algorithm. Because we hrst hnd a fat jet and then recluster to find 
the hardest subjet, our procedure effectively defines an infrared and collinear (IRC) safe 
seeded cone jet algorithm with a radius that scales inversely with px- 


2.2 Track-based observables 

Scaling the jet radius inversely with px decreases the amount of contamination radiation 
in the jet, though it does so at the cost of reducing the number of calorimeter cells in the 
detector that contribute to the jet. As a consequence, the calorimetric angular resolution 
is also reduced, though this effect was to be expected anyway because the angular size 
of the decay products decreases as the px increases. Here, we explore recovering high 

®Ref. [55] proposed a shrinking cone algorithm with a radius determined by the demanding that a fixed 
fraction of final state radiation was captured into the jet. 
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Figure 2: Distributions of the jet mass defined in three different ways in two pT bins: 2 — 5 
TeV (left) and 10 — 15 TeV (right). The red curves are the mass of jets with radius R = 1.0 
measured on tracks; orange cnrves are the mass of jets with scaled radins R = 4mtop/pT 
measured on charged tracks; green curves are the track mass of jets rescaled by the ratio 
of total jet pt to the px of tracks with the scaled jet radius R = Amtop/pT- 


angular resolution by measnring jet observables from charged particle tracks. We propose 
to use standard calorimetric information (together with tracks in the case of particle-flow 
algorithms) to determine the total energy of the jet and use tracking information only 
for its substructure. For observables like the jet mass, when measured on tracks, this 
systematically biases the mass to lower valnes because charged particles do not contain 
the full energy of the jet. However, the bulk of this bias can be removed and the jet mass 
restored to its “nominal” value by simply rescaling the track-based mass by the ratio of 
the total jet px to the px of its charged tracks.^ That is, in this paper we define the 
reconstructed jet mass as 

^ ~ ^tracks "^tracks , (2.6) 

Px 

where m-tracks is the mass as measured on charged tracks and transverse 

momentum of the tracks. We will also compare observables measured on current and 
projected future calorimetry to the track-based measurements. 

In Fig. 2, we plot the top quark jet mass distribution measured on charged tracks to 
illustrate the effectiveness of our procedure, for two jet energy ranges. For these plots, 
we have simulated pp —>• tt at leading order at 100 TeV center-of-mass energy with the 
MadGraph5_aMC@NLO 2.2.2 [64], Pythia 6.4 [65, 66] and Delphes 3.1.2 simulation 

^We thank Gavin Salam for suggesting this procedure. A related technique was employed in the charged- 
track version of the HEPTopTagger [61, 62] of Ref. [63]. 
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chain. Jets are clustered with Fast Jet 3.0.6 [67] using the anti-ZcT algorithm and WTA 
recombination scheme. We identify the jet in the event with the largest pT that lies in the 
appropriate bin. We plot results corresponding to three different jet mass definitions: the 
track jet mass measured on jets with fixed jet radius R = 1.0, the track jet mass measured 
on jets with radius that scales with pT as described earlier, and the rescaled track jet mass 
defined in Eq. (2.6). Both bare track jet masses are, as expected, systematically smaller 
than the true top quark mass. As the jet pT increases from a few TeV to over 10 TeV, 
the track mass as measured on fixed-radius jets significantly drifts to higher values, due to 
the increased contamination from ISR/FSR. On the other hand, the rescaled track mass 
measured on the scaled-radius jet peaks around the top quark mass, independent of the jet 
Pt bin. 

While the measurement of the jet mass is certainly a key observable to successfully 
identify a boosted top quark, it has to be kept in mind that large masses can also be 
generated by perturbative soft and collinear emissions in a QCD jet initiated by light 
quarks or gluons at high transverse momentum. In the collinear approximation, the average 
squared QCD jet mass is [68, 69] 

{m^) a,-plR\ (2.7) 

vr 

where ai is a constant that depends on the jet algorithm and is proportional to the colour 
of the initiating parton. While this provides a rule-of-thumb for the location of the peak, 
the QCD jet mass distribution is very wide. Nevertheless, at sufficiently high px with a 
fixed jet radius, QCD jets can have masses comparable to and even exceeding that of the 
top quark. This approximately occurs when 

( 2 . 8 ) 

^ TT 

or when px > 600 GeV, assuming R ~ Oi/vr ~ 1. Using our scaled jet radius procedure, 
the mass of QCD jets is instead modified to 

CV? (2.9) 

vr ^ 

independent of the jet px, but comparable to the mass of the top quark. Therefore a 
mass cut is not sufficient to efficiently reduce QCD backgrounds, and observables that are 
independent of the jet mass must be used. 

Particularly sensitive observables for boosted top quark identification are those that 
measure the prongy-ness of the jet. QCD jets dominantly consist of a single hard core of 
radiation, while hadronically decaying top quark jets typically have a 3-prong substructure. 
Several such observables have been proposed and studied [3-5]. In this work we employ 
the Wsubjettiness observables and the n-point energy correlation functions e^\ The 
A^-subjettiness observables are defined as 

= '^PTi min {Rfi,..., Rf^} , (2.10) 

ieJ 
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where the sum runs over particles in the jet J, pTi is the transverse momentum of particle 
i with respect to the beam, and Rix is the boost-invariant angle between particle i and 
subjet axis K, and /? > 0 is an angular exponent whose value can be used to control 
sensitivity to wide-angle radiation. In our study, we use the exclusive kT algorithm [56] 
with the WTA recombination scheme on the jet to define the N subjet axes. For the 
identification of 3-prong top quark jets, it has been shown [70, 71] that the ratio 


^3,2 ~ 


(/ 3 ) 


.(/ 3 ) 


( 2 . 11 ) 


is an efficient observable for top quark identihcation and is widely used in both ATLAS 
and CMS experiments. In our analysis, we will measure using only track information 
and on jets with a scaled jet radius, as described in Sec. 2 . 1 . 

The (dimensionless) n-point energy correlation functions are defined as (for n = 2,3,4) 

[72]: 


^2^^ " i PTiPTjR^j, 

Pt i<jej 

4 ^^ = -^ ^ PTiPTjPTk{RijRikRjk)^ , 

i<j<kej 

=—^ "PP PTiPTj PTkPTi {RijRikRiiRjkRjiRkP , ( 2 - 12 ) 

i<j<k<l&J 


where Rij is the boost invariant angle between particles i and j. Employing power count¬ 
ing of the soft and collinear regions of phase space. Ref. [73] showed that the particular 
combination 


n(“A,7) 





37 

/3 




-|- X 



27 

/3 




7 

a 


(2.13) 


for angular exponents a, /3 ,7 is the optimal observable for identihcation of 3-prong jets 
formed from the energy correlation functions. Here, 


X = Ki 


(p™t)2 


cut ^ 2 


m. 


y = 1^2 


top 


(PtI 

"Itop 


(^- 2 ^) 


(2.14) 


where p™* is a proxy for the pT bin of the jet sample of interest. We will take ki = ^2 = 1 for 
simplicity in the following, keeping in mind that varying their values may provide improved 
discrimination. We will hnd that and are complementary observables for 

boosted top identihcation. 


3 Detailed studies at fixed pt 

In this section, we present a detailed study of the methods introduced before in a restricted 
jet Pt range relevant for top quark identihcation at a future collider. The results quantify 
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(a) 



Migf [GeV/c^] 

(b) 


Figure 3: Distributions of the jet mass as measured on anti-fc^ jets with radius R = 1.0 
and pt £ [7.5,10] TeV on boosted top jets and QCD jets from light quarks and gluons, (a) 
Mass distribution as measured from the CMS detector’s calorimeter, (b) Mass distribution 
as measured from a future collider detector’s calorimeter. 


the discrimination power of our top quark tagging methods, show that projected calorime¬ 
try of a future collider is insufficient, and tracked-based information can provide the needed 
complementary information. App. B collects plots and analyses for a different set of jet pT 
bins demonstrating that our arguments have a wide range of validity. 

Here, we consider jets in the pT range of 7.5 — 10 TeV. The reason for this choice is the 
following. First, the pT is sufficiently hard that standard methods show clear limitations. 
The angular size of the top quark decay products in this bin is approximately 


Rtop ~ 


2mtop ^2-175 GeV 
PT ~ 7.5 TeV 


0.05, 


(3.1) 


which is comparable to the resolution of the electromagnetic calorimeters at ATLAS and 
CMS. Therefore, the reconstruction of top quark jets produced in this pT bin will be very 
sensitive to the resolution of the detector and can provide a valuable benchmark for future 
collider detector resolution goals. 

To do this, we will compare the efficiency to identify boosted top quark jets by using 
the standard calorimetric information only and by using our track-based method in a 
simulated CMS-like detector and in a projected future 100 TeV collider using Delphes. 
The parameters for the fast detector simulation are presented in App. A and provide, we 
believe, rather conservative estimates for the resolution of detectors of a future collider. 
All event simulation is done with the MadGraph5_aMC@NLO 2.2.2, Pythia 6.4 and 
Delphes 3.1.2 simulation chain. We study signal samples of boosted Z' —?• tt events. 
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Figure 4: Distributions of the jet mass as measured on anti-fc^ jets with radius R = 
4mtop/pT and px G [7.5,10] TeV on boosted top jets and QCD jets from light quarks and 
gluons, (a) Mass distribution as measured from the CMS detector’s calorimeter, (b) Mass 
distribution as measured from a future collider detector’s calorimeter. 


where the mass of the Z' is varied depending on the px of the top quarks, and background 
samples of the partonic SM processes qq —)• q'q' and gg gg aX a 100 TeV pp collider. Jets 
are reconstructed using the anti-A:^ algorithm with Fast Jet 3.0.6, with either a fixed jet 
radius or using the jet radius that scales with px, defined in Eq. (2.3). We ignore details 
and subtleties of defining quark flavor and gluon flavor jets, and pragmatically define them 
as what results from showering quark or gluon par tons. 

3.1 Mass Distributions 

We begin by presenting the distribution for the jet mass in these samples as measured 
from calorimeter cells with a fixed jet radius R = 1.0. Distributions of the signal and 
background jet masses for the CMS and future collider detectors are presented in Fig. 3. 
Because of the large jet radius, there is a significant contamination from radiation in the 
jets, resulting in all mass distributions, either at CMS or a future collider, peaking at 
masses substantially greater than the top quark mass. Additionally, the light quark and 
gluon background distributions straddle the signal top quark distribution, showing that 
any background QCD jet sample would be essentially indistinguishable from a top quark 
sample from these mass distributions alone. 

In Fig. 4, we present the distributions of the jet mass as measured on calorimeter 
jets with a scaled jet radius R = Amtop/px- With the scaled jet radius, the amount 
of contamination in the jet is greatly reduced; however, the low resolution degrades the 
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Figure 5: Distributions of the rescaled track-based jet mass as measured on anti-Zcr jets 
with radins R = 4mtop/pT and pT G [7.5,10] TeV on boosted top jets and QCD jets from 
light quarks and gluons, (a) Distribution as measnred from the CMS detector’s tracking 
system, (b) Distribution as measured from a future collider detector’s tracking system. 


mass distributions measured in the CMS detector. Signal and background distributions 
overlap and all samples have a peak at zero mass indicating that the jet consists of a single 
calorimeter cell. Detector resolution effects are also significant in the mass distributions 
measured at a future collider. While the angular resolution of the calorimeter for the future 
collider scenario is x2 finer than for CMS (see App. A. 2) and so these distributions do not 
have a peak at zero mass, the overlap of signal and backgronnd is snbstantial, and a cut 
on the mass would only result in a marginal top quark tagging efficiency. 

Finally, in Fig. 5, we present the distributions of the jet mass as measured from tracks 
with the jet radius R = dmtop/pr and rescaling by the ratio of the total jet pT to the track 
Pt- Unlike the jet mass as measured from the calorimeter, the rescaled track mass accu¬ 
rately reproduces the top mass for both the CMS and future collider detectors and barely 
suffers from resolution effects. Additionally, the mass distributions of QCD backgrounds 
are pushed to small values, below the mass of the top quark for both detectors. A cut in 
a window around the top quark mass would therefore robustly and efficiently discriminate 
boosted top quark jets from the QCD background. Larger rejection of QCD background 
can be accomplished by measuring additional snbstrncture observables on the jet. 

3.2 Substructure observables 

As mentioned earlier, a jet mass comparable to the mass of the top quark can be recon¬ 
structed in several ways: for signal, by the decay into hard subjets and no FSR radiation; 
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while for background by the emission of signihcant amounts of soft radiation in the jet. For 
improved rejection of the background, in addition to the mass determination, observables 
on the jet that are sensitive to the 3-prong substructure of the top decay must to be mea¬ 
sured. We therefore consider Wsubjettiness ratio and the energy correlation function 
ratio While a full study of the efficiency as a function of the angular exponents 

and other parameters in these observables may result in improved discrimination power, 
we choose to use the parameters for boosted top tagging as recommended by the original 
investigations. That is, we consider the observables 

T5,2 — T!,2 > -^3 — T'g , i3.Zj 

as measured on jets, which lie in the rescaled track-based mass window mj G [120, 250] 
GeV in the following. Even without optimisation of parameters, we find impressive signal 
to background efficiency rates for jets produced at a 100 TeV collider. 

In Fig. 6 , we present the distributions of r 3^2 and D 3 measured on signal and back¬ 
ground jets. We only consider these observables as measured on tracks and with jet radius 
that scales inversely with pT- Though we only measure on tracks, because both and 
are dimensionless, we do not rescale them by the ratio of the total jet pT to the charged 
track pt- Both r 3^2 and ZI 3 provide excellent separation of signal and background jet sam¬ 
ples, and in a complementary way. For the top quark distribution overlaps with the 
light quark distribution more than it does with gluons, while for ZI 3 the opposite occurs. 
While we do not have a complete theoretical understanding of this behaviour, this suggests 
that simultaneous measurement of t ^^2 and II 3 on jets would provide further discrimination 
power, depending on the light quark and gluon composition of the background sample. 

To emphasise the discrimination power provided by ts ^2 and D 3 , in Fig. 7 we show 
signal versus background efficiency curves (ROC curves) produced by making a sliding 
cut in either observable. As we require jets to lie in the rescaled track mass window 
m G [120, 250], the efficiencies in Fig. 7 include the effect of the mass cut. As anticipated, 
measuring these observables on tracks provides significant improvement in discrimination 
power as compared to including calorimeter information alone. Depending on the quark 
or gluon composition of the background sample, r 3,2 or D 3 measured on tracks provides 
better discrimination power. For example, at a future collider in this pT bin at 50% top 
quark jet efficiency, 83% of gluon jets can be rejected by cuts on the mass and T 3 ^ 2 , while 
94% of light quark jets can be rejected by a mass cut and D 3 . Thus, these observables 
exhibit a nice complementarity in how they can reject QCD background. 

We stress that for the distributions in Fig. 6 and their corresponding ROC curves in 
Fig. 7, no optimisation has been performed. These observables have been applied “out of 
the box”, using the values for their parameters as recommended in the original studies. 
Therefore, it may be possible to substantially improve boosted top quark discrimination 
power by optimising the free parameters that enter the definition of the observables. Nev¬ 
ertheless, even with this naive application of a cut on the track mass and cuts on 73^2 or D 3 
impressive discrimination can be obtained. Additionally, while 73^2 and D 3 are both sensi¬ 
tive to the 3-prong substructure, their relative performance rejecting gluon and light quark 
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Figure 6: Distributions of the jet r 3 /r 2 (top) and (bottom) as measured on anti-fer 
jets with radius R = 4mtop/pr and pT £ [7.5,10] TeV on boosted top jets and QCD 
jets from light qnarks and gluons. Additionally, we require that the rescaled track-based 
mass lie in the window m G [120, 250] GeV. (left) Distributions as measured from the 
CMS detector’s tracking system, (right) Distributions as measured from a future collider 
detector’s tracking system. 

jets is not well understood. The motivation for constructing these observables relies on 
the behavior of QCD in the parametric soft and collinear limits. However, gluon and light 
quark jets are not parametrically different objects, and so, without detailed calculation, 
it may not be possible to understand the performance. Further complication for detailed 
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Figure 7: Signal vs. background efficiency (ROC) curves for top quark identification from 

QCD background utilising r 3^2 and D^: (left) top quarks vs. gluon jets, (right) top quarks 

vs. light quark jets. The ROC curves as measured in the calorimeter are dashed lines and 

as measured on tracks are solid lines for (top) the CMS detector and (bottom) the FCC 

detector. The cut on the jet mass of m G [120, 250] GeV is included in the efficiencies. 


theoretical understanding occurs because we only measure these observables on charged 
tracks. Quark and gluon partons fragment to charged hadrons differently, and this may 
also affect the discrimination power. A detailed study of these effects and their impact on 
discrimination power employing track-based methods of Refs. [74-76] would be certainly 
welcome, even though clearly beyond the scope of this paper. 

4 Conclusions 

We summarize the results of this paper in Fig. 8, illustrating the potential discrimination 
power for identifying boosted top quarks at the detector of a future high energy proton 
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Figure 8: Plots illustrating the efficiency for tagging top quarks and rejecting QCD back¬ 
ground as a function of jet pT using tracking (blue, red) versus calorimetry (light blue, 
magenta) at a future collider. For identifying three-prong substructure the observables r 3^2 
and L >3 have been used, (a) Top tagging efficiency for a fixed light quark and gluon mistag 
rate of 5%. (b) Light quark and gluon mistag rate at hxed top quark efficiency of 50%. 
The bands represent the envelope of efficiencies spanned by the Monte Carlo simulations 
(Herwig 6 and Pythia 6.4) that we use. 


collider modeled with Delphes. On the left, we show the hadronically-decaying top quark 
tagging efficiency as a function of jet pT at fixed mistag rate for jets produced from light 
quarks and gluons of 5% comparing our method using tracking versus using calorimetry 
exclusively. The bands represent the envelope of efficiencies from using either Herwig 6 
[77-79] or Pythia 6.4 Monte Carlo simulations. On the right, we plot the efficiency for 
mis-tagging jets initiated by light quarks or gluons as top quarks at fixed signal efficiency 
of 50%. Our procedure enables significant rejection rates at pts approaching 20 TeV, 
while using calorimetry alone struggles to reject more background than signal. Exploiting 
tracking enables impressive signal efficiency, comparable to that of taggers used at the 
LHC, whose performance is relatively independent of jet pT and extends well beyond 10 
TeV. 

In this paper, we have presented a procedure for the identification of top quarks in 
the multi-TeV energy range as those that could be produced at a future 100 TeV proton 
collider. High-resolution tracking information was required for identification of the prongs 
produced in the top quark decay and contamination due to initial- and final-state radiation, 
underlying event, pile-up, or other sources can be reduced signihcantly by dynamically 
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scaling the jet radius inversely proportional to the transverse momentum of the jets. By 
applying a simple jet selection on the track-based jet mass, A^-subjettiness ratio, and 
energy correlation function based jet substructure observables, very high efficiency for 
discriminating boosted top quarks from light QCD jets can be accomplished, and with 
rates that are relatively independent of the boost. These results are encouraging for the 
prospects of precision studies of the electroweak sector at a future collider. 

Though we have employed a fast detector simulation, we believe that the analysis 
presented here can provide a useful benchmark for performance and response of detectors 
at a future collider. The relatively conservative parameters in the fast detector simulation 
should provide a realistic goal for the resolution of a future detector. Nevertheless, there 
were several effects not included in this analysis. For example, we have not considered 
backgrounds to boosted top quarks from electroweak boson emission from light quarks. 
The main reason is that vector bosons in energetic jets are typically soft [80, 81] and 
therefore when decaying hadronically their effect is very similar to that of QCD radiation 
yet subleading due to the weak coupling. However, they may be relevant in high purity (low 
signal efficiency) samples. In addition, while rescaling the jet radius inversely proportional 
to the transverse momentum and using charged tracks reduces contamination from pile-up, 
we did not include any simulation of pile-up in our analysis. The effects of pile-up on the 
efficiency for tagging top quarks would be important to understand in a dedicated analysis, 
but we believe that the effects would be minimal because we use a recoil-free jet algorithm 
as well as scaling the jet radius. 

Besides these experimental issues, our procedure for tagging highly boosted top quarks 
raises some interesting theoretical questions. Because of our prescription for scaling the 
jet radius, the angular size of the jets we consider becomes very small at sufficiently high 
transverse momenta. By restricting the radiation in the jet to such small angular regions, 
non-global logarithms [82], clustering logarithms [83], or logarithms of the jet radius itself 
[84-86] could become important for a theoretical understanding. For high angular resolu¬ 
tion, we also require measurements on charged tracks, and not on all radiation in the jet. 
Therefore, a model of the fragmentation of partons to charged hadrons is required to pre¬ 
dict the distributions of the jet substructure observables on these jets. An understanding of 
track-based observables [74-76] may also explain the difference that we observed between 
the performance of 73^2 and D 3 on light quark or gluon jets and could potentially predict 
the optimal parameters of those observables for discrimination. 

We also emphasize that the methods presented here apply more broadly than to top 
quarks alone. Studies for the identihcation of hyper-boosted 2-prong jets from hadronic 
decays of W, Z, or H bosons can benefit from these techniques. Contamination radiation 
can be controlled using the scaled jet radius, though, because these objects are colour- 
singlets and narrow resonances, unlike for top quarks, there is no contaminating FSR 
produced. Because the masses of the electroweak bosons are smaller than the top, using 
the high resolution of the tracker becomes even more important, as the structure of these 
jets lies at smaller angular scales. For identification of 2-prong substructure, instead of 73^2 
and D 3 , one would use observables such as r 2 ,i Wsubjettiness ratio and D 2 formed from 
the energy correlation functions [87]. 
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Looking forward to a future hadron collider during the era of the LHC provides context 
and motivation for collider physics studies in extreme environments. Additionally, analyses 
at the LHC itself can benefit from efforts for a future collider. For example, while the 
energies and luminosities at the LHC are an order of magnitude smaller than a proposed 
future collider, our proposal for jet radius rescaling and track-based measurements could 
be useful for analyses at the LHC in a high pile-up environment or over a large energy 
range. Looking forward to Run 2 at the LHC, a new, higher energy regime will be explored 
and will require the use of new techniques to push forward. 
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A Parametrised detector simulation 

The ability to disentangle the structure of jets is largely dependent on detector resolution 
effects, such as the angular and energy-momentum resolution of the jet constituents. In 
addition, the finite geometrical acceptance of the detector, and the presence of a magnetic 
field can have an influence on the measurement of jet properties. Such important detector 
effects have been taken into account by means of the fast simulation framework Delphes 
[46]. For this study, two detector setups have been used: the GMS detector [88], as a 
reference for present performance, and a hypothetical detector for a future circular collider. 
The FCC detector measures particles created from pp collisions with ^/s = 100 TeV. Since 
no prototype of such a detector currently exists yet, the performance of existing detectors 
needs to be extrapolated to future scenarios. We will briefly describe the modeling of the 
detector response for these two configurations in this appendix. 

A.l Tracking 

After collision, parton showering, hadronisation, and decays, the first step of DELPHESis 
the propagation of long-lived particles inside the tracking volume within a uniform axial 
magnetic field parallel to the beam direction. The magnetic field strength B, the size of 
the tracking radius L and the single hit spatial resolution anf, are the main parameters that 
constrain the resolution on the track transverse momentum: 





Figure 9: Tracking efficiency as a function of the transverse distance between the track 
direction and the jet center. 


As a benchmark, we assume that the radius of the FCC inner detector is twice that of 
CMS and that the magnetic field is increased by a factor of 3/2. We also assume that the 
spatial resolution is two times smaller than at CMS, which is possible by designing a 
more granular pixel detector [89]. These improvements in an FCC detector would allow 
measurements of 1 TeV charged hadrons with a precision of (t{pt)/pt — 1 — 2%. 

In Delphes, charged tracks are reconstructed assuming an infinite angular resolution, 
which is a good approximation for moderately boosted objects, but not optimal for the 
energy regime that is considered in this study. A module that performs a smearing on the 
track direction was therefore developed specifically for this study. We assume an overall 
factor 2 with respect to present CMS performance, which is mainly driven by a more 
granular detector, and can eventually improved further by increasing the number a tracker 
layers, which is possible thanks to a larger tracker volume. This leads to a conservative 
estimate for the angular resolution of the FCC tracker detector a{9, (p) ~ 10“^ rad. 

Accounting for the hnite angular resolution on the track direction is not enough in 
a high occupancy regime. Charged particles confined inside a highly boosted jet can be 
extremely collimated, resulting in unresolvable tracker hits, especially in the innermost 
tracking layers. Although an accurate description of this feature would require a full event 
reconstruction by means of a GEANT-based simulation [90], we construct a simple model of 
this effect by parametrising the track reconstruction efficiency as a function of the transverse 
distance between a track and the jet center. The jet center has the highest density of 
charged particles, and so this should describe the dominant effect on the resolution. For 
tracks a distance R from the jet center, we define the track resolution efficiency 

~ arctan , (A.2) 

which is plotted in Fig. 9. 


- 20 





CMS 

FCC 

Bz (T) 

3.8 

6.0 

Length (m) 

6 

12 

Radius (m) 

1.3 

2.6 

eo 

0.90 

0.95 

R* 

0.002 

0.001 


0.2 • pt (TeV/c) 

0.02 - PT (TeV/c) 

a{rj,(t>) 

0.002 

0.001 


Table 2: Tracking-related parameters for the CMS and FCC setup in Delphes. 



CMS 

FCC 

a{E)/E (ECAL) 

7%/\/E©0.7% 

3%/VE © 0.3% 

a{E)/E (HCAL) 

150%/VE © 5% 

50%/VE © 1% 

rf X (j) cell size (ECAL) 

(0.02 X 0.02) 

(0.01 X 0.01) 

rf X (f) cell size (HCAL) 

(0.1 X 0.1) 

(0.05 X 0.05) 


Table 3: Calorimeter parameters for the CMS and FCC setup in Delphes. 


Inspired from CMS public tracker performance [91], we choose cq = 90% (95%) and 
R* = 0.002 (0.001) for the CMS (FCC) detector. We stress that in principle the tracking 
angular resolution depends on the track momentum and pseudorapidity, but for simplicity 
we assume a constant (conservative) value. For a robust determination of the jet center in 
the case of light quarks and gluon jets we rely on the WTA recombination scheme. On the 
other hand, given that top quark jets often consists of three hard prongs, parametrising 
the efficiency as a function of the distance between the track and the jet center would 
result in an underestimate of track losses for tracks that belong to the subleading prongs. 
Therefore, in order to realistically simulate track losses inside top jets we first run the N- 
jettiness clustering algorithm [71, 92] at particle level to help us identify the three dominant 
subjet axes. We then require tracks to pass the efficiency criterion from Eq. (A.2) with 
respect to each subjet core. A summary of all the tracking parameters used for the tracking 
parametrisation in DELPHESis given in Table 2. 

A.2 Calorimetry 

After propagating within the magnetic field, long-lived particles reach the electromag¬ 
netic (ECAL) and hadronic (HCAL) calorimeters. Since these are modeled in DELPHESby 
two-dimensional grids of variable spacing, the calorimeter deposits natively include finite 
angular resolution effects. In order to accurately model the angular resolution on recon¬ 
structed jets, separate grids for ECAL and HCAL has been adopted for this study. The 
ECAL and HCAL maps for the CMS detector are taken from Ref. [88]. 
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20% Top Efficiency 


Pt cut 

[2.5, 5] TeV 

[5, 7.5] TeV 

[7.5,10] TeV 

[10,15] TeV 

[15, 20] TeV 

gluons 

CMS 

2 % 

3% 

4% 

5% 

6 % 

FCC 

1 % 

2 % 

2 % 

3% 

4% 

quarks 

CMS 

1 % 

2 % 

3% 

5% 

7% 

FCC 

0.5% 

1 % 

1.5% 

2 % 

4% 




40% Top Efficiency 




Pt cut 

[2.5, 5] TeV 

[5, 7.5] TeV 

[7.5,10] TeV 

[10,15] TeV 

[15, 20] TeV 

gluons 

CMS 

7% 

9% 

10 % 

14% 

17% 

FCC 

5% 

6 % 

7% 

10 % 

12 % 

quarks 

CMS 

3% 

5% 

7% 

11 % 

17% 

FCC 

1.5% 

2.5% 

4% 

5% 

8 % 




60% Top Efficiency 




Pt cut 

[2.5, 5] TeV 

[5, 7.5] TeV 

[7.5,10] TeV 

[10,15] TeV 

[15, 20] TeV 

gluons 

CMS 

18% 

20 % 

24% 

30% 

38% 

FCC 

13% 

15% 

20 % 

24% 

25% 

quarks 

CMS 

7% 

10 % 

15% 

22 % 

30% 

FCC 

4% 

6 % 

8 % 

11 % 

15% 


Table 4: Table of background rejection rates at fixed signal efficiencies for jet ranging 
from 2.5 TeV to 20 TeV at the CMS or FCC detector. For gluon (quark) jet backgrounds, 
efficiencies are determined from cuts on {D^) measured on tracks. The cut on the 
rescaled track-based jet mass of mj G [120, 250] GeV is included in the efficiencies. 


For the FCC detector we assume that the same elementary calorimeter cells will be 
used, but placed at twice the CMS distance from the interaction point, leading to an 
improved angular resolution by a factor 2 . In order to obtain the best possible energy reso¬ 
lution, the best nominal performance between the CMS [ 88 ] and ATLAS [93] calorimeters 
was chosen. The calorimeter performance for the CMS and FCC setups are summarised in 
Table 3. 

B pt scan and resnlts 

In this appendix, we present the results of our top quark identification procedure over a 
wide range of jet pts at a 100 TeV future collider. A discussion of the physics or analysis 
of the results will be limited, as these were discussed in detail in the text. In Figs. 10, 11 
and 12 , we plot the jet mass, T 3^2 and distributions as measured on calorimeter jets 
with radius R = 4mtop/pr- As the jet pT increases, finite resolution effects dominate, and 
especially in the CMS detector for jets in the highest pT bin, a large fraction of the jets 
consist of a single calorimeter cell. In Figs. 13, 14 and 15, we plot the rescaled track-based 
mass, T 3^2 and ZI 3 distributions on charged tracks for jets with a radius R = 4mtop/pr for 
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both signal and background jets. Figs. 16 and 17 show the signal vs. background efficiency 
curves for a simulated CMS or FCC detector, respectively, at a future 100 TeV collider. In 
these figures, we compare the discrimination power of the jet substructure observables r 3^2 
and L >3 over a wide range of jet pts as measured on calorimeter or track-based jets. As with 
the pt G [7.5,10] TeV bin studied in Sec. 3, discrimination power is significantly improved 
by working with track-based observables. In addition, is a better discriminant over a 
wide range of signal efficiencies than ZI 3 for background gluon jets, but the improvement 
decreases as pT increases. ZI 3 , on the other hand, is a much more powerful observable for 
rejecting light quark background jets than r 3 ^ 2 - These features are further illustrated in 
Table 4 where we list the rejection rates (= 100% — background efficiency) for benchmark 
signal efficiencies, for jets with pts ranging from 2.5 to 20 TeV. 
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Figure 10: Distributions of the jet mass (top), T^jr^ (middle), and (bottom) as 
measured on anti-fc-r jets with radius R = 4mtop/pT and pT G [1.0, 2.5] TeV on boosted 
top jets and QCD jets from light quarks and gluons, (left) Distributions as measured from 
the CMS detector’s calorimeter system, (right) Distributions as measured from a future 
collider detector’s calorimeter system. 
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Figure 11: Distributions of the jet mass (top), T^jr^ (middle), and D 3 (bottom) as 
measured on anti-fc-r jets with radius R = 4mtop/pT and pT G [5.0, 7.5] TeV on boosted 
top jets and QCD jets from light quarks and gluons, (left) Distributions as measured from 
the CMS detector’s calorimeter system, (right) Distributions as measured from a future 
collider detector’s calorimeter system. 
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Figure 12: Distributions of the jet mass (top), T^jr^ (middle), and D 3 (bottom) as 
measured on anti-A:^ jets with radius R = 4mtop/pT and pT G [15,20] TeV on boosted 
top jets and QCD jets from light quarks and gluons, (left) Distributions as measured from 
the CMS detector’s calorimeter system, (right) Distributions as measured from a future 
collider detector’s calorimeter system. 
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Figure 13: Distributions of the rescaled track-based jet mass (top), TzjT^ (middle), and 
D 3 (bottom) as measured on anti-Zcy jets with radius R = 4mtop/pT and pT G [1.0, 2.5] 
TeV on boosted top jets and QCD jets from light quarks and gluons, (left) Distributions 
as measured from the CMS detector’s tracking system, (right) Distributions as measured 
from a future collider detector’s tracking system. 
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Figure 14: Distributions of the rescaled track-based jet mass (top), TzjT^ (middle), and 
D 3 (bottom) as measured on anti-Zcy jets with radius R = 4mtop/pT and pT G [5-0, 7.5] 
TeV on boosted top jets and QCD jets from light quarks and gluons, (left) Distributions 
as measured from the CMS detector’s tracking system, (right) Distributions as measured 
from a future collider detector’s tracking system. 
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Figure 15: Distributions of the rescaled track-based jet mass (top), TzjT^ (middle), and 
D 3 (bottom) as measured on anti-Zcr jets with radius R = 4mtop/pT and pr G [15,20] 
TeV on boosted top jets and QCD jets from light quarks and gluons, (left) Distributions 
as measured from the CMS detector’s tracking system, (right) Distributions as measured 
from a future collider detector’s tracking system. 
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Figure 16: Signal vs. background efficiency (ROC) curves for top quark identification from 
QCD background utilising and ZI 3 with the CMS detector for three pT bins: [1.0, 2.5] 
TeV (top), [5.0, 7.5] TeV (middle), [15,20] TeV (bottom), (left) top quarks vs. gluon jets, 
(right) top quarks vs. light quark jets. The cut on the jet mass of m G [120, 250] GeV is 
included in the efficiencies. 
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Figure 17: Signal vs. background efficiency (ROC) curves for top quark identification from 
QCD background utilising and with the FCC detector for three pT bins: [1.0, 2.5] 
TeV (top), [5.0, 7.5] TeV (middle), [15,20] TeV (bottom), (left) top quarks vs. gluon jets, 
(right) top quarks vs. light quark jets. The cut on the jet mass of m G [120, 250] GeV is 
included in the efficiencies. 
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